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The present invention relates to a method of deter- 



mining the copy number of a nucleotide sequence I in a sample 
using an amplification technique, said method comprising the 
steps of 

5 1) adding nucleotides, primers, polymerase and any further 

reagents, if any, required for the amplification technique 
used to the sample, 
2) performing one or more amplification cycles to amplify the 
nucleotide sequence I for which the copy number has to be 
10 determined; 

where the sample contains a chromosomal second nucleotide se- 
quence II, and 

a) the first nucleotide sequence I is amplified, 

b) the second nucleotide sequence II is amplified, 

15 c) a third nucleotide sequence I ' corresponding to the first 
nucleotide sequence I and present in a control sample is 
amplified at various dilutions, and 
d) a fourth nucleotide sequence II' corresponding to the sec- 
ond nucleotide sequence II and present in a control sample 
20 is amplified at various dilutions, 

where the ratio of the concentrations of nucleotide sequence 
I ' and II' is known 

where the amplifications of the third and fourth nucleotide 
sequences I' and II' at various dilutions allows standard 
25 curves SCi with i being I or II to be made, the concentrations 
of I and II are determined by using the respective standard 
curve SCi , and the relative concentrations allows the rela- 
tive copy number CN of sequence I (versus nucleotide sequence 
II) to be determined using the formula 
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CN is the relative copy number of I over II in the sample; 
tIDsci. is the concentration of I determined using standard 
curve SCi • ; and 

1) [IIJscxx. is the concentration of II determined using stan- 
dard curve SCri' . 

Most eukaryotic diploid cells contain two copies of 
a single gene; one on each chromosome of a pair of chromo- 
somes. The chromosomes of a pair of chromosomes being derived 
from each parent, the genes may be different and, for exam- 
ple, one of them may result in a abnormal protein. Thus, the 
number of functional genes is not necessarily 2 in an eukary- 
ote, and can be 1 or even 0. While often genes are present in 
one copy per chromosome of a particular pair of chromosomes, 
some genes are present in multiple copies, for example in 
15 tandem repeat sequences. Another exception to the general 
rule of 2 copies per eel is mitochondrial DNA. A cell con- 
tains many mitochondria, the number being dependant on the 
type of cell. But even for a particular cell type, the number 
of mitochondria may vary. Typical numbers are between 100 and . 
20 1000 mitochondria per cell, and each mitochondrion contains 
several copies of mitochondrial DNA. In addition, the typical 
copy number is not necessarily equal to larger than 2 per 
cell. Some nucleotide sequences are very rare among cells 
(despite being of one and the same subject, such as a human . 
25 being). This is, for example, after gene rearrangement. This 
is, for example, the case with antibody producing cells (B- 
lymphocytes) or receptor-carrying T-lymphocytes . Of a large 
number of lymphocytes, only a few will contain a particular 
nucleotide sequence defining the variable region of a par- 
30 ticular antibody (or of the T-cell receptor) , capable of rec- 
ognizing a particular antigen, in the art, a need exists to 
reliably determine the copy number of a nucleotide sequence, 
which may comprise the nucleotide sequence of a gene or part 
thereof. A method according to the prearoHe is known in the 
35 art. 1 

A method according to the preaml le is known dis- 
closed by Kwok et al in US 5,389,512. 

The object of the present invention is to improve 
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this method for reliably determining the copy number of a nu- 
cleotide sequence even if it is present in extreme amounts, 
such as lots of copies per cell or only few copies per many 
cells. In addition, an object of the present invention is to 
5 provide a method which has reduced sensitivity to the effi- 
ciency with which DNA was extracted from the cells containing 
a nucleotide sequence I for which the copy number has to be 
determined . 

To this end, the method according to the present in- 
10 vent ion is characterized in that 

at least one pair of amplification reactions chosen 
from i) a) and b) , and ii) c) and d) is performed in a single 
container and monitored spectrophotometrically during ampli- 
fication, and 

15 the t 115 - 3 ^ nucleotide sequence I ' and fourth nucleo- 

tide sequence II' resides on a single vector. 

This allows for a more accurate measurement of rela- 
tive or absolute copy numbers of nucleotide sequence I. Suit- 
able spectrophotometrical methods are known in the art. More 
20 specifically, such methods rely on internal probes for real 

time measurements, for example real time PCR. Internal probes! 
are known in the art, and are disclosed by, for example, Wi- 
ner et al {Anal. Biochem 270, pp. 41-49 (1999)). Measurements 
can be done either continuously, or after finishing an ampli- 
25 fication cycle. While specific reference is made to standard 
curves, it goes without saying that this can be done using 
computational methods without an actual graph being made. 
Hence, in the present application the phrase "making a stan- 
dard curve" involves any method using at least two reference 
points to determine a (relative) concentration. Generally, 
all amplifications will be performed substantially at the' 
same time. By performing multiple amplifications in one con- 
tainer, the room for error is reduced. The method according 
to the invention is not only highly accurate, but it is also 
35 very efficient if performed for multiple samples. That is, 

for each nucleotide sequence I for which it is desired to de- 
termine the copy number, only a single standard curve SC Ia . 
has to be made. With respect to the term "corresponding" as 
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used in the present invention in conjunction with nucleotide 
sequences, this is intended to mean that the nucleotide se- 
quences I and 1 1 (and II and II'), or more specifically the 
nucleotide sequence of one and the complementary sequence of 
the other, are capable of hybridizing under stringent condi- 
tions. If the sequences I and I • (and II and II • ) do not have 
the same length, the shortest of the two is preferably at 
most 50% shorter, more preferably at most 30% shorter. 

The third nucleotide sequence I' and fourth nucleo- 
tide sequence II' residing on the same vector allows their 
ratio to be constant and exactly known (for example 1:1). 
This allows for the most accurate measurements possible, 
it is possible to subject the vector containing both nucleo- 
tide sequence I' and II' to a digestion using one or more re- 
striction enzymes, optionally followed by purification, to 
yield a linear molecule containing both both nucleotide se- 
quence I ' and II', and using this molecule for the amplifica- 
tions required for the standard curves. 

In the present application, a vector is understood 
to be any nucleotide sequence consisting of or containing the 
nucleotide sequence (s) to be amplified. When present on a 
vector capable of being replicated in vitro or in vivo, it is 
easy to obtain that particular nucleotide sequence in desired 
quantities. It is also very easy to determine the DNA concen- 
tration and hence the copy number of the nucleotide sequence 
per volume. A vector capable of replication or being repli- 
cated may be any such vector known in the art, such as a 
Plasmid, a cosmid, a virus etc. If, according to a favourable 
embodiment, the third nucleotide sequence I' resides on first 
vector and the fourth nucleotide sequence II' resides on a 
second vector, the vectors can be used (or mixed) at any de- 
sired ratio to accommodate expected differences in copy num- 
ber in the sample. 

Douek et al (Nature 396, pp. 690-695 (1998)) de- 
scribe a method for detecting the products of the rearrange- 
ments of T-cell receptors (TREC) using a semi-quantitative 
assay. For determining the amount of TREC in a given sample 
a known amount of a DNA competitor are prepared. Then, an 



AMENDED SHEET 



amount of sample DNA containing the nucleotide sequence to be 
determined are added to the tube. A PCR amplification reac- 
tion is carried out in the presence of radiolabeled deoxynu- 
cleotide. Subsequently, the resulting amplification products 
are run on a gel to separate the sample DNA PCR product from 
the competitor DNA product. After autoradiography, the amount 
of nucleotide sequence to be determined is calculated using 
densitometric analysis from the ratio between a band of com- 
petitor DNA and a band of the sample DNA. The result is ex- 
pressed as the number of copies of trec per microgram total 
DNA. To achieve an acceptable accuracy, 4 tubes containing 
scalar amounts of competitor DNA are used, to which fixed 
amounts of sample DNA are added. The disadvantage of this 
method is that when DNA is extracted from cells, it must be 
assumed that this is all the DNA present in the cells. That 
is, it is assumed that no cell escaped lysis and all DNA pre- 
sent in the cells was extracted and isolated. This is not 
necessarily the case. Another disadvantage of this method is 
that it is sensitive to differences in amplification effi- 
ciency. 

The European patent publication EP 0 959 140 dis- 
closes a method and apparatus for determining quantities of 
nucleic acid sequences in samples using standard curves and 
amplification ratio estimates. A plurality of standard sam- 
ples each containing a known quantity of a nucleic acid con- 
trol sequence, and a test sample containing a known quantity 
of the nucleic acid control sequence plus a nucleic acid se- 
quence in an unknown concentration, are subjected to an am- 
plification reaction. The concentration of the nucleic acid 
present in an unknown concentration in the test sample is de- 
termined. 

The European patent publication EP 1 13 8 7 83 dis- 
closes a method for the quantification of a nucleic acid in a 
test sample, by determining the amplification efficiency un- 
der defined conditions, and performing the quantification of 
said nucleic acid under the same defined conditions, allowing 
correction of the concentration determined for said nucleic 
acid. 
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According to a preferred embodiment the absolute 
copy number is determined by multiplying the copy number CN 
by the absolute copy number of sequence II per cell. 

For several nucleotide sequences II the number of 
copies of per cell is known. An example is, for example, the 
gene coding for heat shock protein 70, or Fas Ligand (CD178), 
which are known to be present with two copies per cell (i.e. 
the absolute copy number of hsp 70 = 2). Many nucleotide se- 
quences of genes are very suitable because they generally are 
present in a known number of copies in every cell of the spe- 
cies from which the DNA is derived. The efficiency with which 
DNA material is extracted from the cells is not important 
(although, in case nucleotide sequence I is on a different 
molecule as nucleotide sequence II, it is important that they 
15 are extracted with the same efficiency) . Hence, this embodi- 
ment allows determination of the absolute copynumber of the 
nucleotide sequence I per cell. 

According to a preferred embodiment, at least two 
different third nucleotide sequences I' for measuring a cor- 
responding number of different first nucleotide sequences I 
reside on a single vector. 

In other words, a single vector, requiring its con- . 
centration to be determined only once, can carry multiple 
third nucleotide sequences I', which allows, for example, the 
25 copy numbers of many different genes to be determined. 

Preferrably, the sequence of the first nucleotide 
sequence I is the same as the third nucleotide sequence I'. 

This strongly reduces errors due to differences in 
amplification efficiencies between I and I'. Nevertheless, 
3 0 small differences in nucleotide sequence are generally al- 
lowed, although changes at locations where the probe used for 
detecting the concentration of the nucleotide sequence are 
best avoided. In other word, it is highly preferred if the 
probe is a perfect match for the sequence where it is in- 
35 tended to bind. 

Similarly, it is preferred that the sequence of the 
second nucleotide sequence II is the same as the fourth nu- 
cleotide sequence II ' . 
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While the present invention is described with refer- 
ence to DNA, the present invention also applies to the deter- 
mination of the number of RNA sequences present in a cell. 
Use can be made of methods known in the art to multiply RNA, 
for example by preparing cDNA. This application does not at- 
tempt to teach an interested layman how to become a person 
skilled in the art, for which reason the layman is referred 
to general text books and in particular to a proper univer- 
sity to learn the required techniques that a person skilled 
in the art knows how to apply these techniques to work the 
present invention. 

The present invention will now be illustrated with 
reference to the drawings where 

Fig. 1 represents a standard curve for an mtDNA sequence 
15 I ' (circles) plus data for nucleotide sequence I (squares) ,- 

Pig. 2 represents a standard curve for a nuclear DNA se- 
quence II' (circles) plus data for nucleotide sequence II 
(squares) ; - 

Fig. 3 represents a standard curve for a nuclear DNA se- 
20 quence I- (circles) plus data for nucleotide sequence I 
(squares) ; 

Fig. 4 represents a standard curve for a nuclear DNA se- 
quence II- (FasL) (circles) plus data for nucleotide sequence 
II (squares); and 

Fig. 5 shows the effect of age on the numbers of copies 
of TREC in peripheral lymphocytes (percentage of lymphocyte 
cells expressing TREC) . 

The method according to the invention will be illus- 
trated using two Examples. The first relates to the quanti- 
tive analysis of mitochondrial dna (mtDNA) and demonstrates 
the technique for determining multiple copies per cell. The 
second Example demonstrates the quantitative determination of 
a fractional copy number of a particular nucleotide sequence 
per cell. 

35 EXAMPLE 1 

MATERIALS AND METHODS 
Primers 

The nucleotide sequence I (mtDNA) was a stretch 
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having a length of 102 nucleotides, and corresponds to part 
of the enzyme NADH dehydrogenase as coded for by mtDNA . Am- 
plification of nucleotide sequence I was performed using a 
set of primers, each having a length of 21 nucleotides and 
5 synthesized using standard procedures. The sequences of both 
primers were checked to be unique for human mtDNA using Blast 
software, through the NCBI site at NIH 
(http://www.ncbi.nlm.nih.gov/blast/) . 

The nucleotide sequence II (nuclear DNA) serving as 

10 a reference, was a stretch having a length of 104 nucleotides 
and part of the FasL gene, which comes with two copies per 
human cell . Amplification of nucleotide sequence II was per- 
formed using a set of primers, each having a length of 21 and 
24 nucleotides respectively. 

15 Probes 

To monitor the progress of amplification, a probe was 
used for nucleotide sequence I, the probe having a length of 
23 nucleotides, having a FAM (carboxy fluorescein) fluores- 
cent probe at the 5' end and a BlackHole Quencherl ™ group at 

20 the 3' end. This probe, and all others in this application, 
was ordered commercially with MWG, Ebersberg, Germany. The 
sequence of the probe was checked to be unique for human 
mtDNA using Blast software, through the NCBI site as men- 
tioned above. 

25 The probe used for nucleotide II had a length of 22 nu- 

cleotides and contained TexasRed as the fluorescent label and 
and a BlackHole Quencher2 ™ group at the 3' end (MWG) . 
DNA isolation 

DNA was isolated from HL60, a promyelocytic leukaemia 
30 cell line, using a DNA isolation kit from Qiagen, Hilden, 
Germany according to the instructions of the manufacturer. 
Control 

A vector was constructed, using pGEM-llZ (Promega) con- 
taining the sequences I' and II' head to tail, using standard 
35 genetic engineering techniques, as all too familiar from sam- 
brook et al . (Molecular cloning. A lab manual. (1989)) in E. 
coli. The nucleotide sequences I' and II' were identical to 
their respective I and II counterparts, and present on the 
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vector in a highly defined. 1:1 ratio. 

The absolute concentration of the controls was done us- 
ing limiting dilution assays (Sambrook) 

Amplification 

Amplification was performed using an iCycler Thermal cy- 
cler (BioRad, Hercules, CA, USA) using standard procedures . 
The amplification is performed in plates having 96 wells. 
This instrument allows monitoring of fluoresence in up to 4 
different channels. In short, one cycle of denaturation (95 
°C for 6 min) was performed, followed by 45 cycles of ampli- 
fication .(94 °C for 3 0 s, 60 °C for 60 s) . The amplification 
was performed in a mix that consisted of: Promega PCR buffer 
IX (Promega, Madison, WI, USA), 3.0 inM MgCl2, 400 pmol of 
primers for mtDNA, 0.2 m£! dNTP and 2 U of Taq polymerase 
(Promega) . In accordance with the invention, the amplifica- 
tion for both nucleotide sequences I and II were performed in 
a single well, and the same is true for nucleotide sequences 
I' and II* (for determining the standard curves) . Data were 
analysed using the software of the iCycler. 

The standard curves were made by introducing a known 
number of copies of vector per well. 

Amplification experiments were performed in triplicate. 
RESULTS 

Fig. 1 shows the standard curve for nucleotide sequence 
I' and Fig. 2 shows the standard curve for the nucleotide se- 
quence II' based on FasL. Note the excellent correlation co- 
efficients of 0.995 and 0.996 respectively, indicating the 
excellent accuracy of the method according to the invention. 
Using these curves, the concentration of nucleotide sequences 
I and II (shown as squares in Figs. 1 and 2) were determined. 
As it is known that the nucleotide sequence for FasL (and 
more specifically for the probe for nucleotide ll/ii') is 
present with two copies per cell, the number of copies of nu- 
cleotide sequence I per cell is twice as high, i.e. 160. 

EXAMPLE 2 

Basically, the same method was used as described in Example 
1, except that the nucleotide sequence I corresponded to part 
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of the sequence of the delta locus of the T-cell receptor. 
The method was used to determine the number of copies of TREC 
per cell, in particular peripheral lymphocytes in blood, in 
three age groups (healthy humans of 20, 60 or 100 years. The 
5 number of people were respectively 16 (10) , 17 (10) , and 21 
(17) , with the number of women between parentheses) 

The standard curves for nucleotide sequence I' and II' 
are shown in fig. 3 and 4 respectively. The following corre- 
lation coefficients obtained were: 0.999 and 0.998. 

10 Fig. 5 shows that the number of copies of TREC decreases 

with age (averages per age group shown as a horizontal line) 
from about 3.2 to 0.1 per 100 cells. 

While particularly beneficial for the method ac- 
cording to the present invention in view of the fact that 

15 spec tropho tome trical methods allow simulaneous detection of 
multiple labels, it is possible to perform an amplification 
reaction using any known amplification technique, where the 
third nucleotide sequence I ' and fourth nucleotide sequence 
II' resides on a single vector and the amplifications of each 

20 of I' and II' are performed in separate containers, such as 
separate wells. The application covers this possibility as 
well. Such amplification techniques comprise, apart from the 
ones mentioned above, CP (Cycling Probe Reaction) , bDNA 
(Branched DNA amplification) , SSR (Self -Sustained Sequence 

25 Replication) , SOA (Strand Displacement Amplification) , qbr 
(Q-Beta Replicase) , Re -AMP (Formerly RAMP) , NASBA (Nucleic 
Acid Sequence Based Amplification) , RCR (Repair Chain Reac- 
tion) , LCR (Ligase Chain Reaction) , TAS (Transorption Based 
Amplification System) , and HCS (amplifies ribosomal RNA) . 
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